A Novel Approach for Protein Classification Using Fourier Transform
نویسندگان
چکیده
Discovering new biological knowledge from the highthroughput biological data is a major challenge to bioinformatics today. To address this challenge, we developed a new approach for protein classification. Proteins that are evolutionarilyand thereby functionallyrelated are said to belong to the same classification. Identifying protein classification is of fundamental importance to document the diversity of the known protein universe. It also provides a means to determine the functional roles of newly discovered protein sequences. Our goal is to predict the functional classification of novel protein sequences based on a set of features extracted from each protein sequence. The proposed technique used datasets extracted from the Structural Classification of Proteins (SCOP) database. A set of spectral domain features based on Fast Fourier Transform (FFT) is used. The proposed classifier uses multilayer back propagation (MLBP) neural network for protein classification. The maximum classification accuracy is about 91% when applying the classifier to the full four levels of the SCOP database. However, it reaches a maximum of 96% when limiting the classification to the family level. The classification results reveal that spectral domain contains information that can be used for classification with high accuracy. In addition, the results emphasize that sequence similarity measures are of great importance especially at the family level. Keywords—Bioinformatics, Artificial Neural Networks, Protein Sequence Analysis, Feature Extraction.
منابع مشابه
Formation interface detection using Gamma Ray log: A novel approach
There are two methods for identifying formation interface in oil wells: core analysis, which is a precise approach but costly and time consuming, and well logs analysis, which petrophysists perform, which is subjective and not completely reliable. In this paper, a novel coupled method was proposed to detect the formation interfaces using GR logs. Second approximation level (a2) of GR log gained...
متن کاملOn The Simulation of Partial Differential Equations Using the Hybrid of Fourier Transform and Homotopy Perturbation Method
In the present work, a hybrid of Fourier transform and homotopy perturbation method is developed for solving the non-homogeneous partial differential equations with variable coefficients. The Fourier transform is employed with combination of homotopy perturbation method (HPM), the so called Fourier transform homotopy perturbation method (FTHPM) to solve the partial differential equations. The c...
متن کاملSolid Dispersion Approach Improving Dissolution Rate of Stiripentol: a Novel Antiepileptic Drug
Some drugs have low bioavailability due to their poor aqueous solubility and/or slowdissolution rate in biological fluids. Stiripentol (STP) is a novel anticonvulsant drug that isstructurally unrelated to the currently available antiepileptics. It has poor aqueous solubilityand its solubility has to be enhanced accordingly. Polyethyleneglycol 6000 (PEG-6000) iscommonly utilized as a hydrophilic...
متن کاملSolid Dispersion Approach Improving Dissolution Rate of Stiripentol: a Novel Antiepileptic Drug
Some drugs have low bioavailability due to their poor aqueous solubility and/or slowdissolution rate in biological fluids. Stiripentol (STP) is a novel anticonvulsant drug that isstructurally unrelated to the currently available antiepileptics. It has poor aqueous solubilityand its solubility has to be enhanced accordingly. Polyethyleneglycol 6000 (PEG-6000) iscommonly utilized as a hydrophilic...
متن کاملIdentification and Quantification of Texture Soy Protein in A Mixture with Beef Meat Using ATR-FTIR Spectroscopy in Combination with Chemometric Methods
Meat, as an important source of protein, is one of the main parts of many people’s diet. Due toeconomic interests and thereupon adulteration, there are special concerns on its accurate labeling.In this study Fourier transform infrared (ATR-FTIR) spectroscopy combined with chemometrictechniques (principal component analysis (PCA), artificial neural networks (ANNs), and partial<...
متن کامل